A Supernodal Cholesky Factorization Algorithm for Shared-Memory Multiprocessors
نویسندگان
چکیده
This paper presents a new left-looking parallel sparse Cholesky fac,torization algorithm for shared-memory MIMD multiprocessors. The algorithm is particularly well-suited for vector supercomputers with multiple processors, such as the Cray Y-MP. The new algorithm uses supernodes in the Cholesky factor to improve performance by reducing indirect addressing and memory traffic. Earlier factorization algorithms have also used supernodes in this manner. The new algorithm, however, also uses supernodes to reduce the number of system synchronization calls, often by an order of magnitude or tnore in practice. Experimental results on a Sequent Balance 8000 and a Cray Y-MP demonstrate the effectiveness of thc new algorithm. On eight processors of a Cray Y-MP, the new routine performs the factorization at rates exceeding one Gflop for several test problems from the Harwell Boeing test collection, none of which are exceedingly large by current standards.
منابع مشابه
Supernodal Symbolic Cholesky Factorization on a Local-Memory Multiprocessor
In this paper, we consider the symbolic factorization step in computing the Cholesky factorization of a sparse symmetric positive definite matrix on distributedmemory multiprocessor systems. By exploiting the supernodal structure in the Cholesky factor, the performance of a previous parallel symbolic factorization algorithm is improved. Empirical tests demonstrate that there can be drastic redu...
متن کاملDirect and Incomplete Cholesky Factorizations with Static Supernodes
Introduction Incomplete factorizations of sparse symmetric positive definite (SSPD) matrices have been used to generate preconditioners for various iterative solvers. These solvers generally use preconditioners derived from the matrix system, , in order to reduce the total number of iterations until convergence. In this report, we investigate the findings of ref. [1] on their method for computi...
متن کاملAn Efficient Solver for Sparse Linear Systems Based on Rank-Structured Cholesky Factorization
Direct factorization methods for the solution of large, sparse linear systems that arise from PDE discretizations are robust, but typically show poor time and memory scalability for large systems. In this paper, we describe an efficient sparse, rank-structured Cholesky algorithm for solution of the positive definite linear system Ax = b when A comes from a discretized partial-differential equat...
متن کاملA New Recursive Implementation of Sparse Cholesky Factorization
Consider the Cholesky factorization of a sparse symmetric positive de nite matrix, A = LL . The rst two steps use symbolic, graph-theoretic techniques to order A to reduce ll in L, and to determine the exact sparsity structure of L. The factor L is computed in a third \numeric factorization" step. The two leading schemes for numeric factorization are a blocked column-oriented scheme, and a mult...
متن کاملRing-oriented Block Matrix Factorization Algorithms for Shared and Distributed Memory Architectures
Utilizing experiences from the implementations on shared memory multiprocessors (SMM) and distributed memory multicomputers (DMM), general ring-oriented routines are developed for the LU, Cholesky, and QR factorizations. Since, all machine dependencies are comprised to a small set of communication routines, the same factorization routines can be used on both the SMM and DMM architectures. The a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- SIAM J. Scientific Computing
دوره 14 شماره
صفحات -
تاریخ انتشار 1993